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C I We have analyzed the cross-correlations of daily fluctuations for A^ 

■ 6 358 US stock prices during the year 1999. From those TV (A^ — l)/2 cor- 

relations coefficients, the Minimum Spanning Tree (MST) has been 
> _ 
•/^ I built. We have investigated the topology exhibited by the MST. 

^2^1 ' Eventhough the average topological number is (?i) « 2, the variance a 
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of the topological distribution f{n) diverges! More precisely, we have 
found that f{n) ~ 7i^^-^ holding over two decades. We have studied the 
topological correlations for neighbouring nodes: an extremely broad 



i-rt ■ set of local configurations exists, confirming the divergence of a. 
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In a series of works M, EJ , Mantegna and coworkers have studied the Mini- 
mum Spanning Tree (MST) of financial data, mainly the cross-correlations be- 
tween N different stocks. Building a MST is a common method in spin glasses 
for putting into evidence the primary structure of a complex system with a non- 
trivial dynamics 0]. The MST puts into evidence the cooperative behaviours 
in the stock market. 

Herein, we report some analysis of the topology exhibited by the MST. Due 
to Initial Public Offerings (IPOs) and mergers, it is difficult to perform some 
MST-analysis over long periods. The number of companies is indeed always 
evolving. We focussed our work on A^ = 6 358 US companies during the year 
1999. Those stocks are all traded on the Nasdaq, NYSE and AMEX places. 
Cross-correlations py for pairs ij of stocks are computed as 

^(< K'>~< Y, >2)(< Y^^>-< Y, >2) 
where the F's are the daily fluctuations of the log of the stock prices. The MST 
has been built following the Kruskal's algorithm in order to find out the A^— 1 
most important correlated pairs of stocks among the N{N — l)/2 = 20 208 903 
possible pairs! Figure 1 presents three parts of the MST: AMZN (Amazon.com 
Inc.), YHOO (Yahoo! Inc.), KO (Coca-Cola Company) and their nearest neigh- 
bours. AMZN is connected to EBAY (eBay Inc.), CMGI (CMGI Inc.) and 
SOFN (SoftNet Systems Inc.). YHOO is connected to ARBA (Ariba Inc.) and 
also connected to CMGI such that YHOO is the next nearest neighbour of 
AMZN. KO is connected to PEP (PepsiCo, Inc.), CCE (Coca-Cola Enterprises, 
Inc.) and AVP (Avon Products Inc.). In addition to KO, AVP is connected to 
42 others stocks! 

One can observe a clustering of companies being in the same sector such 



as energy, technology, internet, transportation, food and cosmetics, etc. The 
clustering in various sectors, each being a branch of the MST, has been pre- 
viously observed in |l|. In addition to the sectorization, we have observed a 
trend for a global clustering of stocks traded on different market places. Most 
Nasdaq stocks are located around the QQQ ticker (Nasdaq Composite ticker), 
and largest stocks of NYSE are located around DIA (Dow Jones ticker) . 

One could distinguish three types of topological configurations for compa- 
nies: (i) important nodes, (ii) links, and (iii) dangling ends. The important 
nodes (like AVP) should correspond to companies controlling or mediating the 
fluctuations of their neighbourhood. The links (like YHOO) are mediating the 
information (fluctuation) along the branches. The dangling ends (like PEP) are 
supposed to be the less influent companies, or to be the less influenced by the 
market fluctuations (like indices). 

From the mathematical point of view, we can associate a "topological num- 
ber" 1 < 71 < {N — 1) at each node of the tree. The topological number n is 
defined as the number of connections starting from the considered site, i.e. the 
coordination number of that site. The statistics of the tree is thus determined 
by the distribution of topological numbers f{n). The mean topological number 
is given by 

(n) = ^n/(n). (2) 

n=l 

If one multiplies both sides of this equation by N, the summation becomes equiv- 
alent to count twice all the A^ — 1 connections along the tree. As a consequence, 

one has 

A^- 1 
in) = 2^^ (3) 

which is approximately equal to 2 when the number of sites A^ becomes large, 



i.e. in our case. The above mathematical property is quite general and holds for 
all types of trees. In the following, we will consider that (n) = 2. Thus, the first 
moment (n) of / cannot give any information about the topology of trees since 
it does not reflect the shape of the tree. However, the second moment a can be 
quite useful. Let us assume a topological correlation in between neighbouring 
nodes. The Aboav law, a well known law in the Physics of foams Q, stipulates 
that topological correlations can be captured by the following relationship 

mn = A+— (4) 

n 

where rrin is the mean topological number of the nearest neighbours of a node 

characterized by a topological number n. A and B are positive constants to be 

determined. The form of the Aboav law implies that important nodes with large 

n are more favorably connected to dangling ends. Indeed, when the coordination 

number becomes large and reaches n -^ N — I, which correspond to a star 

structure, one should obtain m„ = 1. This leads to the constraint [^ v4 = 1. For 

a random structure, one expects that the statistical average of the product rnrin 

is similar to (n^), one has 

{n) + B^ (nm,,) ^ (n^) = a^ + A (5) 

and thus B = <j^ + 2 and the Aboav law becomes 

a^ + 2 

m„ = IH (6) 

n 

for any kind of tree with a large number N of nodes. The above equation is 

valid in the hypothesis of random trees. 

Here, we have defined some mathematical tools {f{n), a and the Aboav law) 

for describing the topology of trees in full generality. Let us investigate the spe- 

^ This constraint has a counterpart yl = 5 in the case of foams [U . 



cific topology of financial MST. The log-log plot of the topological distribution 
is shown in Figure 2. The distribution f{n) is broad and looks like a power law 

fin) ^ n-" (7) 

holding over two decades with an exponent a = 2.2 ± 0.1. This result means 
that the variance of the f{n) distribution diverges even the mean (n) remains 
finite (see below)! In other words, nodes with high topological numbers are not 
so rare. For random trees, one expects an exponential decay of f{n) and quite 
different statistics. The market is naturally self-organized in a coordination 
invariant structure, i.e. a self-organized critical structure m! One should also 
remark that our results have an important graphical consequence: large n values 
for nodes are not so rare such that it is extremely hard to draw the entire MST. 

The power law (7) and the divergence of a are not numerical artefacts. In- 
deed, we have checked the Aboav-like law for which the variance of the topolog- 
ical distribution is relevant (see above). Figure 3 presents the (to„, -) diagram 
in which the Aboav law should be a line with a slope cr^ + 2. However, the N 
values of the measured ?Ti„ on the MST are sparsely distributed in the diagram 
such that no relevant linear fit can be performed. Eq.(6) is not valid expressing 
that statistical averages cannot be taken as for usual random topological struc- 
tures. We argue that the extremely broad dispersion of local configurations in 
(m„, i) expresses the divergence of a. 

We put into evidence the emergence of some order out of the apparent dis- 
order of the stock markets! Our results are robust. We did the same analysis 
for subsets of the data and we obtained the same behaviour (a power law for 
f{n) and a divergence of a). Also, we did the same for the "anti-tree", i.e. the 
tree for the strongest anticorrelations, and we have found similar results with 



the same exponent a k, 2.2. As found in B, the connections between pairs of 
stocks are mostly unchanged by taking other time periods. Of course, IPOs 
{N -^ N +1) and mergers {N -^ N ~ I) would influence a httle the MST but 
we stress that the topological properties of the MST would not dramatically 
change. 
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Figure Captions 

Figure 1 — Three typical configurations of the nearest neighborhood of 
three companies on the MST: (a) AMZN (Amazon.com Inc.) connected to 
EBAY (eBay Inc.), CMGI (CMGI Inc.) and SOFN (SoftNet Systems Inc.); (b) 
YHOO (Yahoo! Inc.) is connected to ARBA (Ariba Inc.) and CMGI; (c) KO 
(Coca-Cola Company) is connected to PEP (PepsiCo, Inc.), CCE (Coca-Cola 
Enterprises, Inc.) and AVP (Avon Products Inc.). The topological indices n 
are given. 

Figure 2 — Log-log plot of the topological distribution f{n) of the MST. 
The continuous line is a fit with the power law (7). 

Figure 3 — The (m„, -) diagram for the A^ = 6 358 companies. No Aboav 
(linear) behaviour emerges. 
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